A Protein Domain and Family Based Approach to Rare Variant Association Analysis

نویسندگان

  • Tom G. Richardson
  • Hashem A. Shihab
  • Manuel A. Rivas
  • Mark I. McCarthy
  • Colin Campbell
  • Nicholas J. Timpson
  • Tom R. Gaunt
چکیده

BACKGROUND It has become common practice to analyse large scale sequencing data with statistical approaches based around the aggregation of rare variants within the same gene. We applied a novel approach to rare variant analysis by collapsing variants together using protein domain and family coordinates, regarded to be a more discrete definition of a biologically functional unit. METHODS Using Pfam definitions, we collapsed rare variants (Minor Allele Frequency ≤ 1%) together in three different ways 1) variants within single genomic regions which map to individual protein domains 2) variants within two individual protein domain regions which are predicted to be responsible for a protein-protein interaction 3) all variants within combined regions from multiple genes responsible for coding the same protein domain (i.e. protein families). A conventional collapsing analysis using gene coordinates was also undertaken for comparison. We used UK10K sequence data and investigated associations between regions of variants and lipid traits using the sequence kernel association test (SKAT). RESULTS We observed no strong evidence of association between regions of variants based on Pfam domain definitions and lipid traits. Quantile-Quantile plots illustrated that the overall distributions of p-values from the protein domain analyses were comparable to that of a conventional gene-based approach. Deviations from this distribution suggested that collapsing by either protein domain or gene definitions may be favourable depending on the trait analysed. CONCLUSION We have collapsed rare variants together using protein domain and family coordinates to present an alternative approach over collapsing across conventionally used gene-based regions. Although no strong evidence of association was detected in these analyses, future studies may still find value in adopting these approaches to detect previously unidentified association signals.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of a Novel Splice Site Mutation in RUNX2 Gene in a Family with Rare Autosomal Dominant Cleidocranial Dysplasia

Introduction: Pathogenic variants of RUNX2, a gene that encodes an osteoblast-specific transcription factor, have been shown as the cause of CCD, which is a rare hereditary skeletal and dental disorder with dominant mode of inheritance and a broad range of clinical variability. Due to the relative lack of clinical complications resulting in CCD, the medical diagnosis of this disorder is challen...

متن کامل

Protective Properties of Nontoxic Recombinant Exotoxin A (Domain I-II) Against Pseudomonas aeruginosa Infection

Background: Antibiotic resistance and the need for long-term treatments especially for chronic infections necessitate the development <span style="fon...

متن کامل

A generalized least-squares framework for rare-variant analysis in family data

Rare variants may, in part, explain some of the hereditability missing in current genome-wide association studies. Many gene-based rare-variant analysis approaches proposed in recent years are aimed at population-based samples, although analysis strategies for family-based samples are clearly warranted since the family-based design has the potential to enhance our ability to enrich for rare cau...

متن کامل

Whole exome sequencing revealed a novel dystrophin-related protein-2 (DRP2) deletion in an Iranian family with symptoms of polyneuropathy

Objective(s): Charcot-Marie Tooth disease (CMT) is one of the main inherited causes of motor and sensory neuropathies with variable expressivity and age-of onset. Although more than 70 genes have been identified for CMT, more studies are needed to discover other genes involved in CMT. Introduction of whole exome sequencing (WES) to capture all the exons may help to fin...

متن کامل

Functional Investigation of the Novel BRCA1variant (Glu1661Gly) byComputationalTools andYeastTranscription Activation Assay

Introduction: Mutations in the BRCA1 gene are major risk factors for breast and ovarian cancers. However, the relationship between some BRCA1 mutations and cancer risk remains largely unknown. Cancer risk predictions could be improved by evaluation of the impairment degree in the BRCA1 functions due to a specific mutation. This study aimed to assess the functional effect of a novel variant (Glu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2016